Method and system for a common-attribute virtual desktop infrastructure (vdi) environment with tiered data processing unit (dpu) capabilities

ABSTRACT

A method for managing a virtual desktop infrastructure (VDI) environment includes: obtaining a plurality of target resource specific pool specific configuration templates for a target resource, in which each of the plurality of target resource specific pool specific configuration templates is associated with one or a plurality of virtual desktop (VD) pools, in which the target resource is a data processing unit (DPU); obtaining a common configuration template set; generating a VD pool configuration for each of the plurality of VD pools using the plurality of target resource specific pool specific configuration templates and the common configuration template set to obtain a plurality of VD pool configurations; selecting a default VD pool from the plurality of VD pools; and deploying, based on the selection, a plurality of VDs into the default VD pool using a VD pool configuration associated with the default VD pool.

BACKGROUND

Computing devices may provide services. To provide the services, thecomputing devices may include hardware components and softwarecomponents. The software components may store information usable toprovide the services using the hardware components.

BRIEF DESCRIPTION OF DRAWINGS

Certain embodiments of the invention will be described with reference tothe accompanying drawings. However, the accompanying drawings illustrateonly certain aspects or implementations of the invention by way ofexample, and are not meant to limit the scope of the claims.

FIG. 1 shows a diagram of a system in accordance with one or moreembodiments of the invention.

FIG. 2 shows a diagram of a back-end device in accordance with one ormore embodiments of the invention.

FIG. 3 shows a relationship diagram illustrating the computing resourcesutilized by a virtual desktop (VD) pool in accordance with one or moreembodiments of the invention.

FIGS. 4.1 and 4.2 show a method for managing a common-attribute virtualdesktop infrastructure (VDI) environment with tiered data processingunit (DPU) capabilities in accordance with one or more embodiments ofthe invention.

FIGS. 5.1-5.3 show an example back-end device in accordance with one ormore embodiments of the invention.

FIG. 6 shows a diagram of a computing device in accordance with one ormore embodiments of the invention.

DETAILED DESCRIPTION

Specific embodiments of the invention will now be described in detailwith reference to the accompanying figures. In the following detaileddescription of the embodiments of the invention, numerous specificdetails are set forth in order to provide a more thorough understandingof one or more embodiments of the invention. However, it will beapparent to one of ordinary skill in the art that the one or moreembodiments of the invention may be practiced without these specificdetails. In other instances, well-known features have not been describedin detail to avoid unnecessarily complicating the description.

In the following description of the figures, any component describedwith regard to a figure, in various embodiments of the invention, may beequivalent to one or more like-named components described with regard toany other figure. For brevity, descriptions of these components will notbe repeated with regard to each figure. Thus, each and every embodimentof the components of each figure is incorporated by reference andassumed to be optionally present within every other figure having one ormore like-named components. Additionally, in accordance with variousembodiments of the invention, any description of the components of afigure is to be interpreted as an optional embodiment, which may beimplemented in addition to, in conjunction with, or in place of theembodiments described with regard to a corresponding like-namedcomponent in any other figure.

Throughout this application, elements of figures may be labeled as A toN. As used herein, the aforementioned labeling means that the elementmay include any number of items, and does not require that the elementinclude the same number of elements as any other item labeled as A to N.For example, a data structure may include a first element labeled as Aand a second element labeled as N. This labeling convention means thatthe data structure may include any number of the elements. A second datastructure, also labeled as A to N, may also include any number ofelements. The number of elements of the first data structure, and thenumber of elements of the second data structure, may be the same ordifferent.

Throughout the application, ordinal numbers (e.g., first, second, third,etc.) may be used as an adjective for an element (i.e., any noun in theapplication). The use of ordinal numbers is not to imply or create anyparticular ordering of the elements nor to limit any element to beingonly a single element unless expressly disclosed, such as by the use ofthe terms “before”, “after”, “single”, and other such terminology.Rather, the use of ordinal numbers is to distinguish between theelements. By way of an example, a first element is distinct from asecond element, and the first element may encompass more than oneelement and succeed (or preceded) the second element in an ordering ofelements.

As used herein, the phrase operatively connected, or operativeconnection, means that there exists between elements/components/devicesa direct or indirect connection that allows the elements to interactwith one another in some way. For example, the phrase ‘operativelyconnected’ may refer to any direct connection (e.g., wired directlybetween two devices or components) or indirect connection (e.g., wiredand/or wireless connections between any number of devices or componentsconnecting the operatively connected devices). Thus, any path throughwhich information may travel may be considered an operative connection.

In general, computing resources of pooled virtual desktops (VDs) sharethe same (e.g., fixed) configuration parameters (e.g.,common-attributes) in a virtual desktop infrastructure (VDI)environment. These environments can be referred to as common-attributeVDI environments. Usually this sharing of common-attributes may limitthe computing resources’ production workloads (e.g., reading data from atarget table, writing data from the target table, etc.) of the pooledVDs. Embodiments of the invention relate to methods and systems formanaging a common-attribute VDI environment with tiered DPUcapabilities. More specifically, various embodiments of the inventionmay obtain a plurality of target resource specific pool specificconfiguration templates (also referred herein simply as “a target L2template”) and a common configuration template set. A VD poolconfiguration may then be generated for each of a plurality of VD poolsusing the obtained plurality of target resource specific pool specificconfiguration templates and the obtained common configuration template.A default VD pool from the plurality of VD pools may then be selected.Based on the selection, a plurality of VDs may be deployed into thedefault VD pool using a VD pool configuration associated with thedefault VD pool. Each of the plurality of VDs in the default VD pool maythen be monitored to obtain target resource utilization information(e.g., DPU utilization information). Based on the obtained targetresource utilization information, at least one migration criterion ofthe default VD pool may be satisfied. Based on the satisfied migrationcriterion, if one or more VDs require an additional target resourceavailability, the one or more VDs may be migrated to another VD pool ofthe plurality of VD pools with a higher target resource availability.This advantageously provides a reconfigurable common-attribute VDIenvironment that is able to dynamically adapt to users’ requirements(e.g., requirement of more or less DPU availability for executingprocesses) and/or technical considerations (e.g., security, performance,and storage policies considerations) of a data center.

In the context of one or more embodiments disclosed herein, the term“storage” refers specifically to persistent storage (e.g., non-volatilestorage such as, but is not limited to: disk drives, solid-state drives(SSDs), etc.) while the term “memory” refers specifically tonon-persistent memory (e.g., volatile memory such as, but is not limitedto: random access memory (RAM)).

The following describes various embodiments of the invention.

FIG. 1 shows a diagram of a system (101) in accordance with one or moreembodiments of the invention. The system (101) includes front-enddevices (100), a network (120), back-end devices (130), and a servicemanager (140). The system (101) may include additional, fewer, and/ordifferent components without departing from scope of the invention. Eachcomponent may be operably connected to any of the other component viaany combination of wired and/or wireless connections. Each componentillustrated in FIG. 1 is discussed below.

In one or more embodiments of the invention, the system (101) mayprovide computer-implemented services (e.g., real-time networkmonitoring, backup and disaster recovery, server virtualization, etc.)to users. To provide the computer-implemented services to the users, thesystem (101) may perform computations locally (e.g., at the users’ siteusing the front-end devices (100)) and remotely (e.g., away from theusers’ site using back-end devices (130)). By doing so, the users mayutilize different computing devices (e.g., 600, FIG. 6 ) that havedifferent quantities of computing resources (e.g., processing cycles,memory, storage, etc.) while still being afforded a consistent userexperience. For example, by performing computations remotely, the system(101) may maintain the user experience provided by the differentcomputing devices even when the different computing devices possessdifferent quantities of computing resources.

In one or more embodiments of the invention, to provide theaforementioned computer-implemented services, the system (101) mayinclude any number of front-end devices (front-end device A (100A),front-end device N (100N)). In one or more embodiments, the front-enddevices (100) may be utilized by the users and/or administrators (e.g.,a user with permission to make changes on a computing system that willaffect other users of that computing system). However, not all of theusers and/or administrators may be allowed to receive all of theaforementioned computer-implemented services.

In one or more embodiments of the invention, different front-end devices(100A, 100N) may have different computational capabilities. For example,the front-end device A (100A) may have 16GB dynamic random-access-memory(DRAM) and 1 central processing unit (CPU) with 12 cores, whereas thefront-end device N (100N) may have 8GB persistent memory (PMEM) and 1CPU with 16 cores. Other different computational capabilities of thefront-end devices (100) not listed above may also be taken into accountwithout departing from the scope of the invention.

In one or more embodiments of the invention, to provide a consistentuser experience, the front-end devices (100) may use a virtual desktopinfrastructure (VDI) environment or other types of computingenvironments that enable remote resources (e.g., back-end devices (130))to provide computer-implemented services. In one or more embodiments,the computer-implemented services provided by the back-end devices (130)may appear (to the users) to be provided by the front-end devices (100).In this manner, for example, the front-end devices (100) may be capableof: (i) collecting users’ input; (ii) correlating the collected users’inputs to functionalities of the computer-implemented services to beprovided to the users; (iii) communicating with the back-end devices(130) that perform the computations necessary to provide thefunctionalities of the computer-implemented services; and (iv) using thecomputations performed by the back-end devices (130) to provide thecomputer-implemented services in a manner that appears to be performedlocally to the users.

In one or more embodiments of the invention, the front-end devices (100)may be implemented as computing devices (e.g., 600, FIG. 6 ). Acomputing device may be, for example, a mobile phone, a tablet computer,a laptop computer, a desktop computer, a server, a distributed computingsystem, or a cloud resource. The computing device may include one ormore processors, memory (e.g., RAM), and persistent storage (e.g., diskdrives, SSDs, etc.). The computing device may include instructions,stored on the persistent storage, that when executed by the processor(s)of the computing device cause the computing device to perform thefunctionality of the front-end devices (100) described throughout thisapplication.

Alternatively, in one or more embodiments of the invention, thefront-end devices (100) may be implemented as logical devices. A logicaldevice may utilize the computing resources of any number of computingdevices to provide the functionality of the front-end devices (100)described throughout this application.

In one or more embodiments of the invention, the above-mentioned system(101) components may operatively connect to one another through anetwork (120) (e.g., a local area network (LAN), a wide area network(WAN), a mobile network, a wireless LAN (WLAN), etc.). In one or moreembodiments, the network (120) may be implemented using any combinationof wired and/or wireless connections. The network (120) may encompassvarious interconnected, network-enabled subcomponents (not shown) (e.g.,switches, routers, gateways, etc.) that may facilitate communicationsbetween the above-mentioned system (101) components.

In one or more embodiments of the invention, the network-enabledsubcomponents may be capable of: (i) performing one or morecommunication schemes (e.g., Internet protocol communications, Ethernetcommunications, etc.); (ii) being configured by the computing devices inthe network (120); and (iii) limiting communication(s) on a granularlevel (e.g., on a per-port level, on a per-sending device level, etc.).

In one or more embodiments of the invention, while communicating withthe back-end devices (130) remotely over the network (120), thefront-end devices (100) may receive, generate, process, store, and/ortransmit data structures (e.g., lists, tables, etc.). In one or moreembodiments, the data structures may have a predetermined format inaccordance with a communication protocol (e.g., a transmission controlprotocol (TCP), a user datagram protocol (UDP), etc.) implemented by thefront-end devices (100), the network-enabled subcomponents, the network(120), and/or the back-end devices (130).

In one or more embodiments of the invention, while providing differenttypes of computer-implemented services, the front-end devices (100) maycommunicate with the back-end devices (130) using different ports (e.g.,file transfer protocol (FTP) port 20, network time protocol (NTP) port123, etc.). In this manner, different functionalities of thecomputer-implemented services provided by the front-end devices (100)may be dependent on being able to communicate with the back-end devices(130) via different ports. If such communications are made inoperable,the front-end devices (100) may then be prevented from providingfunctionalities of the computer-implemented services corresponding tothe respective ports.

In one or more embodiments of the invention, the back-end devices (130)may provide remote computer-implemented services to the front-enddevices (100). In one or more embodiments, the front-end devices (100)may access the remote computer-implemented services via the VDIenvironment hosted by the back-end devices (130).

In one or more embodiments of the invention, the VDI environment is adesktop virtualization technology that runs one or more desktopoperation systems (OSs) (e.g., an environment through which a usercontrols a computing device (e.g., 600, FIG. 6 )) in a data center. Inone or more embodiments, the users of the front-end devices (100) mayaccess an image of the desktop OS (e.g., a virtual desktop (VD) image)remotely over the network (120). In this manner, the users may interactwith the desktop OS (including its applications) as if it was runninglocally in the front-end devices (100).

In one or more embodiments of the invention, the VD image may refer to apreconfigured image of the desktop OS in which the desktop environmentis separated from the computing device (e.g., 600, FIG. 6 ) used toaccess it. In one or more embodiments, the accessibility of the users tothe VD image (also referred to herein simply as “VD”) may depend on aconfiguration set by the administrators.

In one or more embodiments of the invention, for example, the users maybe automatically directed to a login screen of the VD when they areconnected to the VDI environment over the network (120). In thisscenario, the VD may only be allocated to a specific user (via a VDidentifier (not shown)). As another example, the users may need toselect a VD from a plurality of VDs (e.g., a VD pool) to launch whenthey are connected to the VDI environment over the network (120). Inthis scenario, the users may have access to all of the VDs in the VDpool.

In one or more embodiments of the invention, once the login screen of aVD is displayed, a user accessing the VD may enter the user’scredentials (e.g., username, password, etc.) on the login screen. Thelogin screen may be a graphical user interface (GUI) generated by avisualization module (not shown) of the back-end devices (130). In oneor more embodiments, the visualization module may be implemented inhardware (i.e., circuitry), software, or any combination thereof.

In one or more embodiments of the invention, once the user has loggedinto the VD, the user may select one or more applications (e.g.,computer programs) and/or may start performing one or more operations(e.g., functions, tasks, activities, etc.) available on the VD. Examplesof the applications may include, but are not limited to: a wordprocessor, a media player, a web browser, a file viewer, an imageeditor, etc.

In one or more embodiments of the invention, the applications installedon the VDs may include functionality to request and use each VD’scomputing resources (e.g., memory, CPU, storage, network bandwidth (BW),graphics processing unit (GPU), DPU, etc.). Additional details regardingthe computing resources of the VDs utilized by the applications aredescribed below in reference to FIG. 2 .

In one or more embodiments of the invention, to be able to provide thecomputer-implemented services of the VDI environment to the users, theback-end devices (130) may need to install a corresponding front-enddevice application (e.g., Remote Desktop Protocol (RDP) application,Enlighted Data Transport (EDT) application, etc.) or may need to run ahypertext markup language (e.g., HTML version 5) based session toinitiate a remote display protocol. The remote display protocol mayinclude, but it is not limited to: Independent Computing Architecture(ICA), EDT, Blast, Personal Computer over Internet Protocol (PCoIP),RDP, etc.

In one or more embodiments of the invention, the remote display protocolmay control the front-end devices’ (100) multimedia capabilities (e.g.,display, audio, video, etc.) via a multimedia engine (not shown). Forexample, the multimedia engine may control a display of a front-enddevice (100A, 100N) such that a status of an application running on theVD may be displayed in real-time to a user of the front-end device(100A, 100N). The status may be displayed in any visual format thatwould allow the user to easily comprehend (e.g., read and parse) thelisted information.

In one or more embodiments of the invention, the multimedia engine isoperatively connected to the front-end device (100A, 100N). Themultimedia engine may be implemented using hardware, software, or anycombination thereof.

In one or more embodiments of the invention, the remote display protocoldiscussed above may compress data that is transmitted to and from thefront-end device (100A, 100N) for a better user experience (e.g.,reduced latency). For example, the user of the front-end device (100A,100N) may operate on a spreadsheet (e.g., an application where data maybe analyzed and stored in tabular form). In this scenario, the front-enddevice (100A, 100N) may transmit user input (e.g., mouse movements,keystrokes, etc.) to the VD and bitmaps (e.g., a format to storecomputing device-independent and application-independent images) may betransmitted in response back to the front-end device (100A, 100N). Morespecifically, the data itself may not be populated to the user display;instead, the user display may show bitmaps that represent the data. Whenthe user enters additional data into the spreadsheet, the front-enddevice (100A, 100N) may only transmit the updated bitmaps.

In one or more embodiments of the invention, to perform the examplediscussed above, the remote display protocol may use a robust headercompression (ROHC) method to compress a header of a communicationprotocol. For example, the ROHC method may compress 40 bytes of IPversion 4 (IPv4) header into 1 byte. In one or more embodiments, theheader of the communication protocol may include, but it is not limitedto: a sequence number of a data packet (e.g., a small amount of datatransmitted over the network (120)), information related to destinationIP, a plurality of pointers to verify the status of the transmitted datapacket, etc.

Additionally, to perform the example discussed above, the remote displayprotocol may use a RDP data compression algorithm to compress the dataitself (e.g., payload). For example, in a 2:1 ratio, the RDP datacompression algorithm may compress 2MB data into 1MB.

In one or more embodiments of the invention, to be able to provide theremote computer-implemented services to the front-end devices (100), theback-end devices (130) may host virtual machines (VMs) (not shown) torun the VDI environment. In one or more embodiments, the VMs may belogical entities executed using computing resources of the back-enddevices (130) or using computing resources of other computing devices(e.g., servers, distributed computing systems, cloud resources, etc.)connected to the backend devices (130). Each of the VMs may beperforming similar or different processes.

In one or more embodiments of the invention, the VMs (and applicationshosted by the VMs) may generate data (e.g., VM data) that is stored inpersistent storage (not shown). In one or more embodiments, the VM datamay reflect the state of a VM.

In one or more embodiments of the invention, in addition to running theVDI environment, the VMs may provide services to the front-end devices(100). For example, the VMs may host instances of databases, emailservers, or other applications that are accessible to the front-enddevices (100). The VMs may host other types of applications not listedabove without departing from the scope of the invention. Moreover, theapplications hosted by the VMs may provide application services to thefront-end devices (100).

In one or more embodiments of the invention, the VMs may be implementedas computer instructions, e.g., computer code, stored on the persistentstorage that when executed by a processor of the back-end devices (130)cause the back-end devices (130) to provide the functionality of the VMsdescribed throughout this application.

In one or more embodiments of the invention, a hypervisor (not shown)may be configured to orchestrate the operation of the VMs by allocatingcomputing resources of the back-end devices (130) to each of the VMs.

In one or more embodiments of the invention, the hypervisor may be aphysical device including circuitry. The physical device may be, but itis not limited to: a field-programmable gate array (FPGA), anapplication-specific integrated circuit, a programmable processor, amicrocontroller, a digital signal processor, etc. The physical devicemay be adapted to provide the functionality of the hypervisor describedthroughout this application.

Alternatively, in one or more embodiments of the invention, thehypervisor may be implemented as computer instructions, e.g., computercode, stored on the persistent storage that when executed by a processorof the back-end devices (130) cause the back-end devices (130) toprovide the functionality of the VMs described throughout thisapplication.

In one or more embodiments of the invention, the back-end devices (130)may be implemented as computing devices (e.g., 600, FIG. 6 ). Acomputing device may be, for example, a mobile phone, a tablet computer,a laptop computer, a desktop computer, a server, a distributed computingsystem, or a cloud resource. The computing device may include one ormore processors, memory (e.g., RAM), and persistent storage (e.g., diskdrives, SSDs, etc.). The computing device may include instructions,stored on the persistent storage, that when executed by the processor(s)of the computing device cause the computing device to perform thefunctionality of the back-end devices (130) described throughout thisapplication.

Alternatively, in one or more embodiments of the invention, similar tothe front-end devices (100), the back-end devices (130) may also beimplemented as logical devices, as discussed above.

In one or more embodiments of the invention, the service manager (140)may manage the remote computer-implemented services provided by theback-end devices (130) by: (i) identifying the services the back-enddevices (130) are to provide (e.g., based on the number of users usingthe front-end devices (100)); (ii) identifying the ports that theback-end devices (130) utilize to provide the remotecomputer-implemented services; and (iii) limiting network communicationswithin the system (101) to only allow the back-end devices (130) tocommunicate using the ports corresponding to the identified services.

To limit the network communications within the system (101), the servicemanager (140) may configure the network-enabled subcomponents. Forexample, the service manager (140) may: (i) disable some of the ports ofthe network-enabled subcomponents; (ii) enable other ports of thenetwork-enabled subcomponents (e.g., those ports corresponding to theremote computer-implemented services provided by the back-end devices(130)); and (iii) reduce the communications BW afforded to the back-enddevices (130).

Therefore, while the back-end devices (130) may be capable of performingany number of remote computer-implemented services, they may be limitedin providing any number of the remote computer-implemented services overthe network (120). For example, the network-enabled subcomponents mayprevent the back-end devices (130) from communicating with the front-enddevices (100) using certain ports (which are required to provide theremote computer-implemented services that are being limited).

In one or more embodiments of the invention, by configuring thenetwork-enabled subcomponents, the remote computer-implemented servicesprovided to the users of the front-end devices (100) may be granularlyconfigured without modifying the operation(s) of the front-end devices(100).

In one or more embodiments of the invention, the service manager (140)may allocate the computing resources (e.g., memory, CPU, storage,network BW, GPU, DPU, etc.) available in the VDI environment to generatea plurality of VD pools. In one or more embodiments, after the pluralityof VD pools are generated, the service manager (140) may migrate (whennecessary) one or more VDs to another VD pool. For example, when one ofthe VDs requires access to more of one type of computing resource (e.g.,access to more DPUs), the service manager (140) may migrate that VD to aVD pool with a higher DPU availability. In another example, when one ofthe VDs requires less access to DPUs, the service manager (140) maymigrate that VD to a VD pool with a lower DPU availability. Additionaldetails regarding VD migration are described below in reference to FIG.4.2 .

In one or more embodiments of the invention, the service manager (140)may be implemented as a computing device (e.g., 600, FIG. 6 ). Thecomputing device may be, for example, a mobile phone, a tablet computer,a laptop computer, a desktop computer, a server, a distributed computingsystem, or a cloud resource. The computing device may include one ormore processors, memory (e.g., RAM), and persistent storage (e.g., diskdrives, SSDs, etc.). The computing device may include instructions,stored on the persistent storage, that when executed by the processor(s)of the computing device cause the computing device to perform thefunctionality of the service manager (140) described throughout thisapplication.

Alternatively, in one or more embodiments of the invention, similar tothe front-end devices (100), the service manager (140) may also beimplemented as a logical device, as discussed above.

Turning now to FIG. 2 , FIG. 2 shows a diagram of a back-end device inaccordance with one or more embodiments of the invention. The back-enddevice (200) may be any one of the back-end devices (130) discussedabove in reference to FIG. 1 . The back-end device (200) may include aplurality of VD pools (210) and resources (220) (also referred to aboveas the “computing resources”). The back-end device (200) may includeadditional, fewer, and/or different components without departing fromthe scope of the invention. Each component may be operably connected toany of the other component via any combination of wired and/or wirelessconnections. Each component illustrated in FIG. 2 is discussed below.

In one or more embodiments of the invention, each VD pool (VD pool A(210A), VD pool N (210N)) may include a plurality of VDs (e.g., VD A, VDB, VD C, etc.) (not shown in FIG. 2 for the sake of brevity). In one ormore embodiments, a VD pool configuration for each of the plurality ofVD pools may be generated using a common configuration template set (notshown) and a plurality of resource specific pool specific configurationtemplates (not shown). Additional details regarding the commonconfiguration template set, the plurality of resource specific poolspecific configuration templates, and the generation of each VD poolconfiguration are described below in reference to FIGS. 3 and 4.1 ,respectively.

In one or more embodiments of the invention, the resources (220) mayinclude, but they are not limited to: memory (220A), a CPU (220B),storage (220C), network (220D), a GPU (220E), a DPU (220F), etc. Each ofthese resources is described below.

In one or more embodiments of the invention, the memory (220A) may beany hardware component that is used to store data in a computing device(e.g., 600, FIG. 6 ). The data stored in the memory (220A) may beaccessed almost instantly (e.g., in milliseconds) regardless of wherethe data is stored in the memory (220A). In one or more embodiments, thememory (220A) may provide the above-mentioned instant data accessbecause the memory (220A) may be directly connected to the CPU (220B) ona wide and fast bus (e.g., a high-speed internal connection thattransfers data between the hardware components of a computing device).

In one or more embodiments of the invention, the memory (220A) may be(or may include), but it is not limited to: DRAM (e.g., double data rate4 (DDR4) DRAM, error correcting code (ECC) DRAM, etc.), PMEM, Flashmemory, etc. In these memory types, the DRAM may be volatile in which itmay only store data as long as it is being supplied with power.Additionally, the PMEM and Flash memory may be non-volatile in whichthey may store data even after a power supply is removed.

In one or more embodiments of the invention, the CPU (220B) may refer toan electronic circuity that may execute operations specified by anapplication. More specifically, the CPU (220B) may perform an operationin three steps: (i) fetching instructions related to the operation fromthe memory (220A); (ii) analyzing the fetched instructions; (iii)performing the operation based on the analysis. In one or moreembodiments, the operation may include, but it is not limited to: basicarithmetic calculations, comparing numbers, performing a function,displaying a video, etc.

In one or more embodiments of the invention, the CPU (220B) may include,but it is not limited to: 10-core (e.g., an individual processor withinthe CPU) (with 3.7 GHz clock speed), two channels DDR4 DRAM support,etc. In one or more embodiments, the clock speed may refer to the numberof instructions that the CPU (220B) is able to handle per second.

In one or more embodiments of the invention, as a central processingvirtualization platform, a virtual CPU (vCPU) application may beprovided to the plurality of VDs. In one or more embodiments, the vCPUapplication may enable the plurality of VDs to have direct access to asingle physical CPU. More specifically, the vCPU application may providecomputing capabilities by sharing the single physical CPU among theplurality of VDs.

In one or more embodiments of the invention, the storage (220C) mayrefer to a hardware component that is used to store data in a computingdevice (e.g., 600, FIG. 6 ). In one or more embodiments, the storage(220C) may be a physical computer readable medium. For example, thestorage (220C) may be (or may include) hard disk drives (e.g., HDDs),Flash-based storage devices (e.g., SSDs), tape drives, fibre-channel(FC) based storage devices, or other physical storage media. The storagemay be other types of digital storage not listed above without departingfrom the scope of the invention.

In one or more embodiments of the invention, the storage (220C) may beconfigured as a storage array (e.g., a network attached storage array).In one or more embodiments, the storage array may refer to a collectionof one or more physical storage devices, in which various forms of datamay be consolidated. Each physical storage device may includenon-transitory computer readable storage media, in which the data may bestored in whole or in part, and temporarily or permanently.

In one or more embodiments of the invention, the SSDs may use anon-volatile memory express (NVMe) or NVME over Fabrics (NVMe-oF)interface protocols for accessing Flash storage via a peripheralcomponent interconnect express (PCI-e) bus. In one or more embodiments,in comparison to a SATA bus (e.g., 600 MB/s data transfer rate), thePCI-e bus may provide much higher data transfer rate (e.g., 64,000MB/s). In this manner, the NVMe may execute, for example, 65535 commandqueues simultaneously (with a depth of 65536 commands per queue),whereas a SATA-based SSD may execute only one command queue (with adepth of 32 commands per queue).

In one or more embodiments of the invention, the NVMe-oF interfaceprotocol may extend capabilities of the NVMe interface protocol over aremote direct memory access (RDMA) or Fibre Channel fabric (e.g., ahigh-speed networking technology to transfer data among data centers,computer servers, etc.). In this manner, the capabilities of the NVMeinterface protocol may be provided over distances.

In one or more embodiments of the invention, the network (220D) mayrefer to a computer network including two or more computers that areconnected any combination of wired and/or wireless connections. In oneor more embodiments, the computer network may be generated usinghardware components (e.g., routers, access points, cables, switches,etc.) and software components (e.g., OSs, business applications, etc.).

In one or more embodiments of the invention, geographic location maydefine the computer network. For example, a LAN may connect computingdevices in a defined physical space (e.g., office building), whereas aWAN (e.g., Internet) may connect computing devices across continents. Inone or more embodiments, the computer network may be defined based onnetwork protocols (e.g., TCP, UDP, IPv4, etc.). Details regarding thenetwork protocols are described above in reference to FIG. 1 .

In one or more embodiments of the invention, the quality ofcommunication over a computer network may be determined by measuring thecomputer network’s quality of service (QoS). In one or more embodiments,the QoS may include one or more hardware and/or software components toguarantee the computer network’s ability to run high-priorityapplications under limited network capacity. The hardware and/orsoftware components operating on the computer network may accomplishthis by providing differentiated handling (e.g., a networkingarchitecture to classify and manage QoS on computer networks) andcapacity allocation. In one or more embodiments, parameters that can beused to measure the QoS may include, but they are not limited to: BW,delay, jitter, error rate, etc.

In one or more embodiments of the invention, the GPU (220E) may refer toan electronic circuitry that may provide parallel data processingcapabilities to generate enhanced, real-time graphics and to performaccelerated computing tasks (which is particularly useful for machinelearning (ML) operations).

In one or more embodiments of the invention, the GPU (220E) may include,but it is not limited to: graphics memory controller, video processingengine, graphics and computation engine, etc.

In one or more embodiments of the invention, as a graphicsvirtualization platform, virtual GPU (vGPU) application may be providedto the plurality of VDs. In one or more embodiments, the vGPUapplication may enable the plurality of VDs to have direct access to asingle physical GPU. More specifically, the vGPU application may providethe parallel data processing and accelerated computing capabilities bysharing the single physical GPU among the plurality of VDs.

In one or more embodiments of the invention, breadth-first anddepth-first GPU allocation policies may be utilized for vGPU-enabledVDs. In one or more embodiments, each hypervisor (discussed above inreference to FIG. 1 ) may use the breadth-first or the depth-first GPUallocation policy by default. Each of these GPU allocation policies isdescribed below.

In one or more embodiments of the invention, the breadth-first GPUallocation policy may reduce the number of vGPUs running on a pluralityof physical GPUs. For example, newly generated vGPUs may be placed on aphysical GPU that has the fewest vGPUs already resident on it. In one ormore embodiments, the breadth-first GPU allocation policy may providehigher performance because it reduces sharing of the plurality ofphysical GPUs.

In one or more embodiments of the invention, the depth-first GPUallocation policy may increase the number of vGPUs running on theplurality of physical GPUs. For example, the newly generated vGPUs maybe placed on a physical GPU that has the most vGPUs already resident onit. In one or more embodiments, the depth-first GPU allocation policymay provide higher density of vGPUs, particularly when different typesof vGPUs are being run. However, the depth-first GPU allocation policymay also provide lower performance because it may maximize sharing ofthe plurality of physical GPUs.

In one or more embodiments of the invention, the DPU (220F) may refer toan electronic circuitry that may perform accelerated data processing andoptimized data movement data within a data center. In one or moreembodiments, the DPU may include, but it is not limited to: high-speednetworking interface (e.g., 200 gigabits per second (200Gbps)), DRAM,multi-core (e.g., 8-core) CPU, programmable acceleration engines(particularly for ML, security, and telecommunications purposes), etc.

In one or more embodiments of the invention, as a data processingvirtualization platform, a virtual DPU (vDPU) application may beprovided to the plurality of VDs. In one or more embodiments, the vDPUapplication may enable the plurality of VDs to have direct access to asingle physical DPU. More specifically, the vDPU application may providefull data center-on-chip programmability, high performance networkingand computing capabilities by sharing the single physical DPU among theplurality of VDs.

Turning now to FIG. 3 , FIG. 3 shows a relationship diagram illustratinghow computing resources are utilized by a VD pool in accordance with oneor more embodiments of the invention. In one or more embodiments of theinvention, as discussed above in reference to FIG. 2 , a VD poolconfiguration of a VD pool (300) may be generated using: a commonconfiguration template set (302) (e.g., a level one (L1) template); anda resource specific pool specific configuration template (304) (e.g., alevel two (L2) template).

In one or more embodiments of the invention, the L1 template maycorrespond to one or more common templates that are used to configure aset of computing resources across the plurality of VD pools. In one ormore embodiments, the VDs residing in different VD pools (e.g., 210A,210N, FIG. 2 ) may have common-attributes (e.g., the same VD image, CPUassignment, and GPU assignment, etc.). However, in certain circumstancesbased on users’ requirements and/or technical considerations of a datacenter, there may be a requirement for some of the configurationparameters to be fixed and some of the configuration parameters to bevariable (e.g., tiered). As a result, some of the configurationparameters may need to be redefined by combining L1 and L2 templates.

In one or more embodiments of the invention, the L2 template maycorrespond to a template that includes a specific configuration for atarget resource (e.g., DPU). This specific configuration may only beused for a particular (e.g., specific) VD pool. For example, consider ascenario in which there are six types of resources (discussed above inreference to FIG. 2 ) and the administrators wish to optimize a DPUconfiguration of three particular VD pools (e.g., VD pool B, VD pool C,and VD pool D) because these three VD pools may be DPU specific VDpools. In this scenario, there may be only a single L1 template thatincludes configurations for each of the resource types (e.g., a singleL1 template that includes configuration parameters for memory, CPU,storage, network, GPU, and DPU). Said another way, only a single L1template may be retrieved and used for all of the VD pools.Alternatively, there may be six separate L1 templates that each includesonly a single configuration directed to one of the resource types (e.g.,one L1 template for memory, one L1 template for CPU, one L1 template forstorage, one L1 template for network, one L1 template for GPU, and oneL1 template for DPU).

In one or more embodiments, in order to optimize the DPU configurationof each of the three DPU specific VD pools, the administrators may needto generate three L2 templates for the DPU resource (e.g., a 1 GB/s (1Gigabyte per second) BW (bandwidth) vDPU with 1 GB vDPU frame buffer L2template, a 2GB/s BW vDPU with 1GB vDPU frame buffer L2 template, and a4GB/s BW vDPU with 1GB vDPU frame buffer L2 template). For each of thethree DPU specific VD pools, the administrators may then need to combinethe L1 template(s) with each of the respective L2 templates to fullydefine the configuration of each of the three DPU specific VD pools.

In the above-discussed scenario, while the remaining VDs in the VDIenvironment continue sharing the common-attributes, each of the threeDPU specific VD pools may have the following configurations: (i) VD poolB: the L1 template and the 1 GB/s BW vDPU with 1 GB vDPU frame buffer L2template; (ii) VD pool C: the L1 template and the 2 GB/s BW vDPU with 1GB vDPU frame buffer L2 template; and (iii) VD pool D: the L1 templateand the 4 GB/s BW vDPU with 1 GB vDPU frame buffer L2 template.

FIGS. 4.1 and 4.2 show a method for managing a common-attribute VDIenvironment with tiered DPU capabilities in accordance with one or moreembodiments of the invention. While various steps in the method arepresented and described sequentially, those skilled in the art willappreciate that some or all of the steps may be executed in differentorders, may be combined or omitted, and some or all steps may beexecuted in parallel without departing from the scope of the invention.

Turning now to FIG. 4.1 , the method shown in FIG. 4.1 may be performedby, for example, the service manager (e.g., 140, FIG. 1 ). Othercomponents of the system (e.g., 101, FIG. 1 ) illustrated in FIG. 1 mayalso contribute to the performance of the method shown in FIG. 4.1without departing from the scope of the invention.

In Step 400, a target resource is selected. For example, in one or moreembodiments of the invention, the selected target resource may be DPU.

In Step 402, a number of VD pools to be created is determined.Continuing with the scenario discussed above in reference to FIG. 3 ,the administrators may determine how many VD pools (e.g., three) need tobe created. Alternatively, the number of VD pools that needs to becreated may be predetermined based on technical considerations of a datacenter.

In Step 404, a target resource specific pool specific configurationtemplate for each of the VD pools is obtained. In one or moreembodiments of the invention, the target L2 template for each of the VDpools may be obtained from, for example: the front-end devices (e.g.,100, FIG. 1 ), persistent storage, target L2 template vendor(s), etc.

In one or more embodiments of the invention, prior to obtaining thetarget L2 template, the target L2 template for each of the VD pools maybe generated. More specifically, one or more target L2 templates (e.g.,a 1 GB/s BW vDPU with 1GB vDPU frame buffer template, a 2 GB/s BW vDPUwith 1GB vDPU frame buffer template, a 4 GB/s BW vDPU with 1 GB vDPUframe buffer template, etc.) may be generated by administrators based onusers’ preferences. The users’ preferences may include, but they are notlimited to: the names of applications that need to be run, the type ofthe VD image, the version of the VD image, etc.

Alternatively, in one or more embodiments of the invention, the targetL2 templates may be generated without considering the users’preferences. For example, even though a user has requested thegeneration of a 1 GB/s BW vDPU with 1 GB vDPU frame buffer L2 template,a 2 GB/s BW vDPU with 1 GB vDPU frame buffer L2 template, and a 4 GB/sBW vDPU with 1 GB vDPU frame buffer L2 template, the user may not beprovided with these requested L2 templates because 1 GB vDPU framebuffer may not be an available resource. For example, the user may beprovided with previously generated target L2 templates such as a 1 GB/sBW vDPU with 512 MB vDPU frame buffer L2 template, a 2 GB/s BW vDPU with512 MB vDPU frame buffer L2 template, and a 4 GB/s BW vDPU with 512 MBvDPU frame buffer L2 template that are generated based on currentlyavailable target resources (e.g., 512 MB vDPU frame buffer) among theresources (e.g., 220, FIG. 2 ).

In Step 406, a common configuration template set (e.g., the L1 template)is obtained. Details regarding the L1 template are described above inreference to FIG. 3 .

In Step 408, a target L2 template from the plurality of target L2templates is selected. Continuing with the scenario discussed above inreference to FIG. 3 , the 2 GB/s BW vDPU with 1 GB vDPU frame buffer L2template may be selected from among the 1 GB/s BW vDPU with 1 GB vDPUframe buffer L2 template, the 2 GB/s BW vDPU with 1 GB vDPU frame bufferL2 template, and the 4 GB/s BW vDPU with 1 GB vDPU frame buffer L2template.

In Step 410, a VD pool configuration for each of the plurality of VDpools is generated using the plurality of target L2 templates and the L1template. In one or more embodiments of the invention, a VD poolconfiguration of a specific VD pool may be generated using the selectedtarget L2 template for that specific VD pool and the L1 template (thatis common for all the VD pools). For example, a VD pool configurationmay be generated for VD pool C using the L1 template and the 2 GB/s BWvDPU with 1 GB vDPU frame buffer L2 template (e.g., the selected targetL2 template for VD pool C as discussed above via the example presentedin FIG. 3 ).

In Step 412, a first determination is performed to determine whetheradditional VD pool configurations are required. If the result of thefirst determination is YES, the method returns to Step 408. If theresult of the first determination is NO, the method proceeds to Step414. For example, referring to the example scenario discussed above inreference to FIG. 3 , if the result of the first determination in Step412 shows that only one VD pool configuration is generated (for VD poolB) while VD pool configurations have not yet been generated for VD poolsC and D, the method may return to Step 408 to generate the remaining twoVD pool configurations (for VD pool C and VD pool D).

In Step 414, a default VD pool is selected from the plurality of VDpools. Continuing with the scenario discussed above in reference toFIGS. 3, VD pool C may be selected as the default pool. In one or moreembodiments of the invention, the default VD pool may be selected basedon users’ preferences or may be selected automatically (based onprevious users’ target resource utilization information). For example,based on the users’ preferences for selecting VD pools with moreresource availability, VD pool D may be selected because VD pool D has ahigher target resource availability. As another example, based on theprevious users’ target resource usage data, VD pool C may be selectedbecause VD pool C has an average target resource availability.

In one or more embodiments, the previous users’ target resource usagedata may be based on data collected about VD pools that are assigned tousers previously (by the administrators). For example, during the lastfour default VD pool assignment processes, VD pool D was assigned to oneuser and VD pool C was assigned to three other users. For this reason,the previous users’ target usage data indicates that the next usershould be assigned to VD pool C.

In Step 416, a plurality of VDs are deployed into the default VD poolusing a VD pool configuration associated with the default VD pool.Continuing with the example discussed above in FIG. 3 , the plurality ofVDs may be deployed into VD pool C because of the VD pool configurationof VD pool C (e.g., the configurations included in the L1 template andthe 2 GB/s BW vDPU with 1 GB vDPU frame buffer L2 template of VD poolC). Additional details regarding the deployment of VDs into a default VDpool are described below in reference to FIGS. 5.1-5.3 .

In one or more embodiments of the invention, the method may endfollowing Step 416.

Turning now to FIG. 4.2 , the method shown in FIG. 4.2 may be performedby, for example, the service manager (e.g., 140, FIG. 1 ). Othercomponents of the system (e.g., 101, FIG. 1 ) illustrated in FIG. 1 mayalso contribute to the performance of the method shown in FIG. 4.2without departing from the scope of the invention.

In Step 418, each of the VDs is monitored to obtain target resourceutilization information. In one or more embodiments of the invention, inthe event that the target resource is DPU, the target resourceutilization information may be DPU utilization information of each ofthe VDs. Continuing with the scenario discussed above in reference toFIG. 3 and assuming that the target resource is DPU, the target resourceutilization information may indicate, for example, “VD A’s current DPUutilization is 10%”, “VD B’s current DPU utilization is 90%”, and “VDE’s current DPU utilization is 8%”.

In Step 420, target resource telemetry is obtained. In one or moreembodiments of the invention, the target resource telemetry may refer toadditional DPU utilization information that is obtained from othercomponents of the system (e.g., 101, FIG. 1 ) illustrated in FIG. 1 .For example, the service manager may obtain the additional DPUutilization information from the persistent storage and/or one or morehyperconverged nodes (not shown). In one or more embodiments, ahyperconverged node may refer to a unified data center that combinesstorage, computation, networking, and management functionalities of adata center in a single hardware component.

More specifically, in one or more embodiments of the invention, theservice manager may obtain the additional DPU utilization informationthrough an application programming interface (API) call. In one or moreembodiments, the API may represent a collection of methods andprocedures (e.g., retrieving information about an API source, updatingthe API source, etc.) that may be executed by other applications in acomputing system (e.g., 600, FIG. 6 ). The collection of methods andprocedures may be designed and configured to facilitate the servicemanager’s access for DPU utilization check and manipulation of locallystored DPU utilization information. For example, the service manager maymake an API call to each of the other components of the system (e.g.,101, FIG. 1 ) illustrated in FIG. 1 . In return, that API call mayobtain the target resource telemetry. Other resource monitoring methodsmay also be taken into account without departing from the scope of theinvention.

In Step 422, a second determination is performed based on the targetresource utilization information (including the information obtained inStep 418 and Step 420) to determine whether at least one migrationcriterion of the default VD pool is satisfied. If the result of thesecond determination is YES, the method proceeds to Step 424. If theresult of the second determination is NO, the method returns to Step418.

In one or more embodiments of the invention, the second determinationmay be performed by comparing the target resource utilizationinformation of each of the VDs with the migration criterion of thedefault VD pool. Additional details regarding the migration criterion ofthe default VD pool are described below in reference to Step 426 andStep 428.

In one or more embodiments of the invention, the migration criterion mayinclude a network BW threshold value. In one or more embodiments, thenetwork BW may refer to the maximum allowed amount of data transmittedover the network in a given amount of time (e.g., the BW of thenetwork). For example, a 1 GB/s BW vDPU with 1 GB vDPU frame buffer VDpool may have a 1 GB/s BW and a 2 GB/s BW vDPU with 1 GB vDPU framebuffer VD pool may have a 2 GB/s BW. As another example, if one of theVDs deployed into the default VD pool (e.g., a 2 GB/s BW vDPU with 1 GBvDPU frame buffer VD pool) needs a higher amount of network BW becauseof higher DPU utilization, this VD may need to be migrated to a VD pool(e.g., a 4 GB/s BW vDPU with 1 GB vDPU frame buffer VD pool) that has amuch higher amount of network BW.

In one or more embodiments of the invention, the service manager maystore the migration criterion locally in persistent storage or mayobtain the one migration criterion from the administrators.

In Step 424, a third determination is performed to determine whetheradditional DPU is required. If the result of the third determination isYES, the method proceeds to Step 426. If the result of the thirddetermination is NO, the method proceeds to Step 428.

In Step 426, one or more VDs that require more DPU access are migratedto a VD pool with a higher (e.g., larger) DPU availability. Continuingwith the example discussed in Step 418, VD B may need to be migratedfrom a 2 GB/s BW vDPU with 1 GB vDPU frame buffer VD pool to a 4 GB/s BWvDPU with 1 GB vDPU frame buffer VD pool because the current DPUutilization of VD B (e.g., 90%) is above one of the migration criteria(e.g., migration required if the current DPU utilization is above 60%)of the 2 GB/s BW vDPU with 1 GB vDPU frame buffer VD pool. Additionaldetails regarding this example migration are described in reference toFIGS. 5.1-5.3 .

In one or more embodiments of the invention, the method may endfollowing Step 426.

In Step 428, one or more VDs that require less DPU access are migratedto a VD pool with a lower (e.g., smaller) DPU availability. Continuingwith the example discussed in Step 418, VD A may need to be migratedfrom a 2 GB/s BW vDPU with 1 GB vDPU frame buffer VD pool to a 1 GB/s BWvDPU with 1 GB vDPU frame buffer VD pool because the current DPUutilization of VD A (e.g., 10%) is below one of the migration criteria(e.g., migration required if the current DPU utilization is below 15%)of the 2 GB/s BW vDPU with 1 GB vDPU frame buffer VD pool. Additionaldetails regarding this example migration are described in reference toFIGS. 5.1-5.3 .

In one or more embodiments of the invention, the method may endfollowing Step 428.

Start of Example

The following section describes an example of one or more embodiments.The example, illustrated in FIGS. 5.1-5.3 , is not intended to limit thescope of the embodiments disclosed herein and is independent from anyother examples discussed in this application.

Turning to the example, consider a scenario in which five VDs (e.g., VDA (502A), VD B (502B), VD C (502C), VD D (502D), and VD E (502E)) aredeployed into VD pool C (504) (e.g., a default VD pool). Initially, FIG.5.1 shows a diagram of an example of a back-end device (500). For thesake of brevity, not all components of the example back-end device maybe illustrated in FIG. 5.1 .

Based on the VD pool configuration of VD pool C, the five VDs aredeployed into VD pool C (504). More specifically, assume here that theVD pool configuration of VD pool C (504) includes: (i) 8 GB DRAM; (ii) 2vCPUs with 4 cores; (iii) 80 GB SSD storage; (iv) 10 GB/s BW with 5 mslatency QoS; (v) depth-first vGPU with 2 GB vGPU frame buffer; and (vi)2 GB/s BW vDPU with 1 GB vDPU frame buffer.

At this time, all five VDs are directed (e.g., instructed) to use theabove-mentioned resources of VD pool C (504) to provide services to theusers and/or administrators.

Turning now to FIG. 5.2 , FIG. 5.2 shows a diagram of the exampleback-end device (500) at a later point in time. The service manager (notshown) starts monitoring each VD in VD pool C (504). After obtainingeach VD’s DPU utilization information, the service manager makes a firstdetermination that at least one migration criterion of VD pool C (504)is satisfied. More specifically, the service manager has determined thatVD A (502A)′s current DPU utilization of 10% is below VD pool C’s (504)lower DPU utilization threshold of 15%. Following this firstdetermination, the service manager makes a second determination thatanother migration criterion of VD pool C (504) is satisfied. Morespecifically, VD B (502B)′s current DPU utilization of 90% has exceededVD pool C’s (504) upper DPU utilization threshold of 60%. Following thissecond determination, the service manager makes a third determinationthat another migration criterion of VD pool C (504) is satisfied. Morespecifically, VD E (502E)′s current DPU utilization of 4% has stayedunder VD pool C’s (504) lower DPU utilization threshold of 15%.

Based on the above three determinations made by the service manager, theservice manager decides to migrate VD A (502A) and VD E (502E) to VDpool B (502). This is because the VD pool configuration of VD pool B(502) includes: (i) 8 GB DRAM; (ii) 2 vCPUs with 4 cores; (iii) 80 GBSSD storage; (iv) 10 GB/s BW with 5 ms latency QoS; (v) depth-first vGPUwith 2 GB vGPU frame buffer; and (vi) 1 GB/s BW vDPU with 1 GB vDPUframe buffer. The service manager also decides to migrate VD B (502B) toVD pool D (506) because the VD pool configuration of VD pool D (506)includes: (i) 8 GB DRAM; (ii) 2 vCPUs with 4 cores; (iii) 80 GB SSDstorage; (iv) 10 GB/s BW with 5 ms latency QoS; (v) depth-first vGPUwith 2 GB vGPU frame buffer; and (vi) 4 GB/s BW vDPU with 1 GB vDPUframe buffer.

At this time, all five VDs are still directed to use the same resourcesto provide services to the users and/or administrators.

Turning now to FIG. 5.3 , FIG. 5.3 shows a diagram of the exampleback-end device (500) at yet a later point in time. Following thedecisions made by the service manager, VD A (502A) and VD E (502E) aremigrated to VD pool B (502), and VD B (502B) is migrated to VD pool D(506). At this time, in response to the VD pool migrations, VD A (502A)and VD E (502E) are now directed to use the resources of VD pool B(502), and VD B (502B) is now directed to use the resources of VD pool D(506). VD C (502C) and VD D (502D) are still directed to use the sameresources of VD pool C (504).

End of Example

Turning now to FIG. 6 , FIG. 6 shows a diagram of a computing device inaccordance with one or more embodiments of the invention.

In one or more embodiments of the invention, the computing device (600)may include one or more computer processors (602), non-persistentstorage (604) (e.g., volatile memory, such as RAM, cache memory),persistent storage (606) (e.g., a hard disk, an optical drive such as acompact disk (CD) drive or digital versatile disk (DVD) drive, a flashmemory, etc.), a communication interface (612) (e.g., Bluetoothinterface, infrared interface, network interface, optical interface,etc.), an input device(s) (610), an output device(s) (608), and numerousother elements (not shown) and functionalities. Each of these componentsis described below.

In one or more embodiments, the computer processor(s) (602) may be anintegrated circuit for processing instructions. For example, thecomputer processor(s) may be one or more cores or micro-cores of aprocessor. The computing device (600) may also include one or more inputdevices (610), such as a touchscreen, keyboard, mouse, microphone,touchpad, electronic pen, or any other type of input device. Further,the communication interface (612) may include an integrated circuit forconnecting the computing device (600) to a network (not shown) (e.g., aLAN, a WAN, such as the Internet, mobile network, or any other type ofnetwork) and/or to another device, such as another computing device.

In one or more embodiments, the computing device (600) may include oneor more output devices (608), such as a screen (e.g., a liquid crystaldisplay (LCD), plasma display, touchscreen, cathode ray tube (CRT)monitor, projector, or other display device), a printer, externalstorage, or any other output device. One or more of the output devicesmay be the same or different from the input device(s). The input andoutput device(s) may be locally or remotely connected to the computerprocessor(s) (602), non-persistent storage (604), and persistent storage(606). Many different types of computing devices exist, and theaforementioned input and output device(s) may take other forms.

The problems discussed throughout this application should be understoodas being examples of problems solved by embodiments described herein,and the various embodiments should not be limited to solving thesame/similar problems. The disclosed embodiments are broadly applicableto address a range of problems beyond those discussed herein.

While embodiments discussed herein have been described with respect to alimited number of embodiments, those skilled in the art, having thebenefit of this Detailed Description, will appreciate that otherembodiments can be devised which do not depart from the scope ofembodiments as disclosed herein. Accordingly, the scope of embodimentsdescribed herein should be limited only by the attached claims.

What is claimed is:
 1. A method for managing a virtual desktopinfrastructure (VDI) environment, the method comprising: obtaining aplurality of target resource specific pool specific configurationtemplates for a target resource, wherein each of the plurality of targetresource specific pool specific configuration templates is associatedwith one of a plurality of virtual desktop (VD) pools, wherein thetarget resource is a data processing unit (DPU); obtaining a commonconfiguration template set; generating a VD pool configuration for eachof the plurality of VD pools using the plurality of target resourcespecific pool specific configuration templates and the commonconfiguration template set to obtain a plurality of VD poolconfigurations; selecting a default VD pool from the plurality of VDpools; and deploying, based on the selection, a plurality of VDs intothe default VD pool using a VD pool configuration associated with thedefault VD pool.
 2. The method of claim 1, further comprising:monitoring each of the plurality of VDs to obtain target resourceutilization information; making a determination, based on the targetresource utilization information, that at least one migration criterionof the default VD pool is satisfied; making a second determination,based on the determination, that an additional target resourceavailability is required; and migrating, based on the seconddetermination, one or more VDs that require the additional targetresource availability to a VD pool of the plurality of VD pools withhigher target resource availability, wherein the VD pool is associatedwith a VD pool configuration of the plurality of VD pool configurationsand wherein the VD pool configuration is associated with a targetresource specific pool specific configuration template that is differentthat a target resource specific pool specific configuration templateassociated with the default VD pool.
 3. The method of claim 2, whereinmaking the determination that at least the one migration criterion ofthe default VD pool is satisfied comprises: comparing the targetresource utilization information of each of the plurality of VDs with atleast the one migration criterion of the default VD pool.
 4. The methodof claim 2, wherein at least the one migration criterion comprises anetwork bandwidth (BW) threshold value.
 5. The method of claim 1,wherein each of the plurality of target resource specific pool specificconfiguration templates comprises target resource related VD poolconfiguration parameters.
 6. The method of claim 1, wherein the DPU is atwo gigabytes per second bandwidth virtual DPU (vDPU) with a onegigabyte vDPU frame buffer.
 7. The method of claim 1, wherein the commonconfiguration template set comprises non-target resource related VD poolconfiguration parameters.
 8. A method for managing a virtual desktopinfrastructure (VDI), the method comprising: monitoring a plurality ofvirtual desktops (VDs) to obtain target resource utilizationinformation, wherein the plurality of VDs is deployed into a default VDpool from a plurality of VD pools, wherein a target resource is a dataprocessing unit (DPU) and wherein the target resource utilizationinformation is associated with the target resource; making adetermination, based on the target resource utilization information,that at least one migration criterion of the default VD pool issatisfied; making a second determination, based on the determination,that an additional target resource availability is required; andmigrating, based on the second determination, one or more VDs thatrequire the additional target resource availability to a VD pool of theplurality of VD pools with higher target resource availability, whereinthe VD pool is associated with a VD pool configuration of the pluralityof VD pool configurations and wherein the VD pool configuration isassociated with a target resource specific pool specific configurationtemplate that is different than a target resource specific pool specificconfiguration template associated with the default VD pool.
 9. Themethod of claim 8, further comprising: prior to monitoring the pluralityof VDs: obtaining a plurality of target resource specific pool specificconfiguration templates for the target resource, wherein each of theplurality of target resource specific pool specific configurationtemplates is associated with one of the plurality of VD pools; obtaininga common configuration template set; generating a VD pool configurationfor each of the plurality of VD pools using the plurality of targetresource specific pool specific configuration templates and the commonconfiguration template set to obtain a plurality of VD poolconfigurations; selecting the default VD pool from the plurality of VDpools; and deploying, based on the selection, the plurality of VDs intothe default VD pool using a VD pool configuration associated with thedefault VD pool.
 10. The method of claim 8, wherein making thedetermination that at least the one migration criterion of the defaultVD pool is satisfied comprises: comparing the target resourceutilization information of each of the plurality of VDs with at leastthe one migration criterion of the default VD pool.
 11. The method ofclaim 8, wherein at least the one migration criterion comprises anetwork bandwidth (BW) threshold value.
 12. The method of claim 9,wherein each of the plurality of target resource specific pool specificconfiguration templates comprises target resource related VD poolconfiguration parameters.
 13. The method of claim 8, wherein the DPU isa one gigabyte per second bandwidth virtual DPU (vDPU) with a onegigabyte vDPU frame buffer.
 14. The method of claim 9, wherein thecommon configuration template set comprises non-target resource relatedVD pool configuration parameters.
 15. A non-transitory computer readablemedium comprising computer readable program code, which when executed bya computer processor enables the computer processor to perform a methodfor managing a virtual desktop infrastructure (VDI) environment, themethod comprising: obtaining a plurality of target resource specificpool specific configuration templates for a target resource, whereineach of the plurality of target resource specific pool specificconfiguration templates is associated with one of a plurality of virtualdesktop (VD) pools, wherein the target resource is a data processingunit (DPU); obtaining a common configuration template set; generating aVD pool configuration for each of the plurality of VD pools using theplurality of target resource specific pool specific configurationtemplates and the common configuration template set to obtain aplurality of VD pool configurations; selecting a default VD pool fromthe plurality of VD pools; and deploying, based on the selection, aplurality of VDs into the default VD pool using a VD pool configurationassociated with the default VD pool.
 16. The non-transitory computerreadable medium of claim 15, further comprising: monitoring each of theplurality of VDs to obtain target resource utilization information;making a determination, based on the target resource utilizationinformation, that at least one migration criterion of the default VDpool is satisfied; making a second determination, based on thedetermination, that an additional target resource availability isrequired; and migrating, based on the second determination, one or moreVDs that require the additional target resource availability to a VDpool of the plurality of VD pools with higher target resourceavailability, wherein the VD pool is associated with a VD poolconfiguration of the plurality of VD pool configurations and wherein theVD pool configuration is associated with a target resource specific poolspecific configuration template that is different that a target resourcespecific pool specific configuration template associated with thedefault VD pool.
 17. The non-transitory computer readable medium of 16,wherein making the determination that at least the one migrationcriterion of the default VD pool is satisfied comprises: comparing thetarget resource utilization information of each of the plurality of VDswith at least the one migration criterion of the default VD pool. 18.The non-transitory computer readable medium of 16, wherein at least theone migration criterion comprises a network bandwidth (BW) thresholdvalue.
 19. The non-transitory computer readable medium of 15, whereineach of the plurality of target resource specific pool specificconfiguration templates comprises target resource related VD poolconfiguration parameters.
 20. The non-transitory computer readablemedium of 15, wherein the common configuration template set comprisesnon-target resource related VD pool configuration parameters.